Sequence Alignment Tools: One Parallel Pattern to Rule Them All?

نویسندگان

  • Claudia Misale
  • Giulio Ferrero
  • Massimo Torquati
  • Marco Aldinucci
چکیده

In this paper, we advocate high-level programming methodology for next generation sequencers (NGS) alignment tools for both productivity and absolute performance. We analyse the problem of parallel alignment and review the parallelisation strategies of the most popular alignment tools, which can all be abstracted to a single parallel paradigm. We compare these tools to their porting onto the FastFlow pattern-based programming framework, which provides programmers with high-level parallel patterns. By using a high-level approach, programmers are liberated from all complex aspects of parallel programming, such as synchronisation protocols, and task scheduling, gaining more possibility for seamless performance tuning. In this work, we show some use cases in which, by using a high-level approach for parallelising NGS tools, it is possible to obtain comparable or even better absolute performance for all used datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Set of Annotation Interfaces for Alignment of Parallel Corpora

Annotation interfaces for parallel corpora which fit in well with other tools can be very useful. We describe a set of annotation interfaces which fulfill this criterion. This set includes a sentence alignment interface, two different word or word group alignment interfaces and an initial version of a parallel syntactic annotation alignment interface. These tools can be used for manual alignmen...

متن کامل

An Overview of Multiple Sequence Alignment Parallel Tools

Multiple sequence alignment is a key problem to most bioinformatics applications. The last ten years have witnessed a big improvement to existing multiple alignment tools and the development of new ones. Various parallel architectures have been experimented for reaching the highest level of accuracy and speed. This paper surveys most popular tools to clarify how parallelism accelerates the proc...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

Alignment Tools for Parallel Treebanks

This paper reports about our efforts in creating a tri-lingual parallel treebank. The focal points are consistency checking and all aspects of sub-sentential alignment. We discuss the alignment guidelines, the importance of quality checks, and special alignment problems. Then we look at alignment algorithms and alignment visualization tools and we compare our own TreeAligner with other alignmen...

متن کامل

BioGAT: a Grid Toolkit for Bioinformatics Sequence Alignment

Even though there are many useful tools for sequence alignment such as Blast by NCBI, Fasta by the University of Virginia and Smith-Waterman, by NIST, one of the key issues is that sequence databases are exploding in size, growing at an exponential rate. Grid and parallel computing are crucial techniques to maintain and improve the effectiveness of sequence comparison tools, whereas the Web Ser...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2014  شماره 

صفحات  -

تاریخ انتشار 2014